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Abstract 

In wireless communication systems, dual-polarized (DP) instead of single-polarized (SP) multiple- 
input multiple-output (MIMO) transmission is used to improve the spectral efficiency under certain 
conditions on the channel and the signal-to-noise ratio (SNR). In order to identify these conditions, 
we first propose a novel channel model for DP mobile Ricean MIMO channels for which statistical 
channel parameters are readily obtained from a moment-based channel decomposition. Second, we 
derive an approximation of the mutual information (MI), which can be expressed as a function of those 
statistical channel parameters. Based on this approximation, we characterize the required SNR for a DP 
MIMO system to outperform an SP MIMO system in terms of the MI. Finally, we apply our results to 
channel measurements at 2.53 GHz. We find that, using the proposed channel decomposition and the 
approximation of the MI, we are able to reproduce the (practically relevant) SNR values above which 
DP MIMO systems outperform SP MIMO systems. 
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I. Introduction 

Multiple-input multiple-output (MIMO) transmission is by now a well established technique 
to enhance the spectral efficiency over wireless channels. While commonly antennas with the 
same polarization are considered for MIMO systems, the use of dual-polarized (DP) antennas 
is known to offer advantages in terms of the spectral efficiency under certain conditions on 
the channel and the signal-to-noise ratio (SNR). Besides being able to improve the spectral 
efficiency, DP antennas allow for compact MIMO systems with co-located antennas due to the 
strong decorrelation over orthogonal polarizations. 

In order to understand the influence of channel properties and the SNR on the spectral 
efficiency, channel models are commonly used. The main goal of channel models is to give 
a simplified yet accurate representation of the effects of the channel on the transmitted signal. 
They thus allow to replace the use of sophisticated channel measurements that are specific 
to a measurement environment, and, furthermore, they can allow for analytical evaluations. A 
good overview on the modeling of DP MIMO channels can be found in [[Il-lEl. Experimental 
results regarding DP MIMO channels are presented in, e.g., [|2l, [|4l-[|6l. Furthermore, in 0, the 
orthogonality of DP MIMO channels is characterized, and, in [[8l, the impact of Ricean fading 
channels on the diversity performance is investigated analytically. 

Unfortunately, an accurate and analytically tractable modeling of DP MIMO channels is a 
difficult task. One has to resort to several assumptions in order to obtain analytical expressions, 
e.g., for the mutual information (MI), and thus to assess the influence of the channel on the 
spectral efficiency. It is known that DP MIMO systems are attractive in Ricean channels [|2]|, [l9]|. 
However, the channel and the SNR conditions for a DP MIMO system to outperform a single- 
polarized (SP) MIMO system in terms of the spectral efficiency are not fully characterized and 
they are time-dependent. Expressions relating the statistical channel parameters to the spectral 
efficiency are usually limited to restrictive channel models with separable correlation, i.e., a 
Kronecker structure, and/or without a Ricean component; moreover, they often rely on asymptotic 
settings. For recent contributions regarding analytical expressions of the MI for Ricean channels 
in asymptotic settings, see [fTOl and references therein. The dependence of the spectral efficiency 
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of SP and DP MIMO channels on the SNR and the i^-factor is demonstrated, e.g., in [Ij 
with simulated channels. The spectral efficiency of measured SP and DP MIMO channels with 
(instantaneous) channel state information (CSI) at the receiver (RX) only has been compared, e.g., 
in [[TT]| with indoor measurements at 2.4 GHz, in [[T2| with indoor measurements at 1.95 GHz, 
or in [ini with outdoor measurements at 2.5 GHz. While IHl conclude that DP MIMO 
systems are favorable, [fTTI concludes that, especially for low K-factors, SP MIMO systems are 
recommended to reach higher spectral efficiencies. Therefore, as highlighted in [j4j, it is not 
straightforward to decide when to use a DP instead of an SP MIMO system. We also note that 
SP MIMO systems would highly benefit from the availability of CSI at the transmitter (TX). 

Consequently, we first aim at establishing a general channel model for SP and DP MIMO 
systems which is reasonably accurate, yet analytically tractable. Second, we aim at identifying 
the conditions on the channel and the SNR under which it is beneficial, in terms of spectral 
efficiency, to make use of the polarization domain for a limited number of antennas at both link 
ends. The reason to limit the number of simultaneously used antennas is that it is desirable to 
keep a low number of radio frequency chains since they are expensive components in a wireless 
system. One can then perform antenna switching between differently polarized antennas, i.e., 
between SP and DP MIMO systems. 

Contributions: We detail a general modeling approach for SP and DP MIMO channels. 
Furthermore, we evaluate the achievable rate over such channels for the case that the TX has only 
statistical CSI, while the RX has instantaneous CSI. In particular, we contribute the following: 

• We propose a general model for SP and DP mobile Ricean MIMO channels. Furthermore, 
we derive a moment-based channel decomposition yielding the statistical channel model 
parameters from measured data. 

• We give an approximation of the achievable rate, i.e., the MI, which is an explicit function 
of the statistical parameters of the proposed channel model. We can thus assess the influence 
of the statistical channel parameters on the achievable rate. 

• We use the approximate MI to characterize the required SNR for a DP setup to outperform 
an SP setup. Specifically, we give a closed-form expression of such an SNR threshold for 
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the practically relevant case of a dual-stream DP setup vs. a single-stream SP setup. 
• We evaluate the channel decomposition and the MI for 4x4 SP and DP MIMO systems based 
on urban macrocell measurements at 2.53 GHz. We find that the DP setup is advantageous 
in terms of the MI for medium- to high- ii'-f actor links above a certain SNR. With the 
approximate evaluation of the MI, we can reproduce the crossing points between the MI of 
the SP and DP MIMO systems. 
Structure: We first introduce the MIMO system model in Section [III Then, in Section Hill 
we develop the channel model and its corresponding decomposition technique for SP and DP 
channels. Section |IV] deals with the performance assessment for SP and DP MIMO transmission. 
In Section |Vl the channel measurements and the data selection are presented, before proceeding 
with the results in Section |Vll Finally, we draw the conclusion in Section IVIII 

Notation: We use lowercase and uppercase boldface letters to designate vectors and matrices, 
respectively. For a matrix A, the (element- wise) complex conjugate, the transpose, and the 
conjugate transpose are denoted by A*, A^, and A^, respectively. The unique Hermitian positive 
semidefinite square root of a Hermitian positive semidefinite matrix A is represented by A^. 
For the matrix A, tr {A}, rank {A}, and Amax(A) denote the trace, the rank, and the maximal 
eigenvalue, respectively. For two matrices A and B, A B is the Hadamard (element-wise) 
product and A B is the Kronecker product. The vectorization, i.e., the column-wise stacking, 
of the matrix A is denoted by vec {A}. The N x N identity matrix is represented by I at and the 
all-zero matrix of size A^^i x N2 is denoted by Oat^ ^Tj. The real- valued MN x MN commutation 
matrix K.m,n satisfies }^m,n vec {A} = vec { A^} for an M x matrix A. Consider an M x 
matrix A with k = 1, . . . , M and / = 1, . . . , A^; we use [A]k^i to denote the element in the A;th 
row and the Ith column of A, and we define A+ such that [A+]fci = max{[A]fc i,0} holds. 
Expectation is denoted by E{-}, log(-) is the logarithm to the base 2, and In(-) is the natural 
logarithm. The imaginary unit is represented by j. 

II. System Model 

We consider a MIMO channel which is characterized by time-varying and frequency-flat 
fading. The input-output relation for transmission from A^tx antennas at the TX to A^rx antennas 
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at the RX is given at time slots m G Z by the received length- A^rx column vector 

y[m] = H[m]x[m] + n[m]. (1) 

The random channel matrices {H[m]}, each of size A^rxX A'tx, are jointly proper. The length- A'tx 
column vectors {x[m]} denote the zero-mean jointly proper Gaussian transmitted vectors that 
are uncorrected in time with spatial covariance matrix E |x[m]x^[m]} = P^Qlm], > 0, and 
tr {Q[m]} = 1. The length- A'rx column vectors {n[m]} are the white jointly proper Gaussian 
noise vectors in time with spatial covariance matrix E|n[m]n^[m]} = ct'^Inrx > 0- 

The random processes {H[m]}, {n[m]}, and {x[m]} are assumed to be mutually independent. 
For ease of exposition, we define the (nominal) SNR p = Px/o'n- We assume the RX to have 
instantaneous CSI, i.e., the RX has knowledge of the current channel realization H[m]. The TX, 
on the other hand, only has statistical CSI of the channel. 

in. Channel Modeling and Decomposition 

A channel model has to be accurate yet simple enough to offer insight on the influence of 
the relevant channel parameters on the system performance. Several approaches to model the 
channel exist; they can be mainly classified in physical and analytical models [14J. We choose 
the popular correlation-based analytical modeling approach for MEMO channels which is easier 
to use for analytical evaluations and which requires statistical parameters that are, in general, 
readily available from measurement data. Correlation-based analytical models can contain a 
term representing line-of-sight (LOS) or a strong scatterer [15J for each MIMO sub-link. The 
amplitude of the sub-links then changes from a Rayleigh to a Ricean distributed random variable. 
The ratio between the power of the dominant component and the power of the remaining weaker 
component is referred to as the iC-factor. 

A. Channel Model 

It is common to represent the dominant components of the MIMO channel by a deterministic 
rank-one matrix [[T6l . [fTTl . While this is usually applicable for an SP MIMO system in an LOS 
scenario where the TX and the RX are fixed, it is not appropriate in general. This is especially 
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true for DP MIMO systems where independent propagation along orthogonal polarizations might 
occur. Moreover, in the presence of a mobile terminal (MT), the dominant channel component, 
i.e., a strongscatterer or LOS, has a varying phase and as a consequence the mean of the channel 
is zero llTSl J^I We thus introduce the following model for SP and DP mobile MEMO channels: 



U\m] 



Hvv["^] HHv["^] 



+ 



HvvH HhvH 
HvhM HhhH 



(2) 



=HH 



=H[ml 



where H[m] contains the dominant contributions, which are due to LOS or strong scatterers, 
and H[m] contains the remaining contributions of the channel. The A^Tx,a x ^Rx,b sub-matrices 



(3) 



and Hab[m] contain the sub-links with polarization a at the TX and b at the RX for a,b E {V, H}. 
Here, V and H denote vertical and horizontal polarizations, respectively!^ The number of vertical- 
polarized (VP) and the number of horizontal-polarized (HP) antennas at the TX are given by 
A^Tx,v and A^tx.h. respectively. We thus have A^tx,v + ^tx,h = ^tx- The relations at the RX side 
are obtained analogously. In the SP case, we either use only VP or only HP antennas. In the DP 
case, we assume that, at both the TX and the RX, one half of the antennas is VP while the other 
half is HP. We split the dominant contributions into the deterministic amplitude matrix Vafe[m] 
and the random phase matrix ^ablfn] with [$afe["^]]fc,i = e-'''^'^'''('-i)^Rx+*['"l for A; = 1, . . . , A^rx,;, 
and / = !,..., A^xx.a- The remaining weaker scatterers are represented by the zero-mean proper 
Gaussian matrix H[m], i.e., Hab[m] for a,b E {V, H}. As highlighted in 0, the challenging 
part is the modeling of the dependence between the phases of the dominant components (f)ab,p[fn] 



'Another reason for a zero-mean channel can be the consideration of channel samples at other frequencies as different channel 
realizations. 

^We note that other polarization choices, e.g., corresponding to a slanted scheme, are possible as well; however, we choose 
vertical and horizontal polarizations as they often have different propagation characteristics, see [12] for an example in an indoor 
scenario. 
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for p = 1, . . . , Njx,aN-Rx,b- We first consider all MIMO sub-links with polarization a at the TX 
and b at the RX. For p,q = 1, . . . , A^Tx,aA^Rx,6, we assume 

1) 'Pab,p['m] is independent of H[m], 

2) (pab,p['nT] is uniformly distributed over [—71,11), 

3) = 0a6,pH - 0afe,g["^] is deterministic. 

The first two assumptions are commonly used, see, e.g., flSj. However, a note is in order regarding 
the last assumption. As mentioned above, the contributions from the dominant components are 
not deterministic, e.g., due to the mobility of the MT. For the case that all MIMO sub-links of 
the same polarization combination a and b observe the same dominant component and that the 
distances between the TX, the RX, and a possible dominant scatterer are considerably larger 
than the array sizes, the resulting phase changes are equal for all of these sub-links. Therefore, 
Z\^'^^[m] is modeled as constant inside a region of constant statistical channel parameters, i.e., 
Z\^'^j,[m] is deterministic. Clearly, assumption 3) is not satisfied for all antenna setups, e.g., it 
would not necessarily hold for a MIMO system made of directional antennas with different 
orientations. Therefore, for each polarization, we require the (directional) antennas at the TX 
and the RX to be oriented in the same direction. Using assumption 3), we can rewrite ^ as 

H,4m] = Yab[m] A^,,4m] e''^'^'^"'\ a, b G {V, H} (4) 

where we defined (f)ab['m] = (f)ab,i['m] and the deterministic matrix A^^abim] = ^abim] e'^'^"^^^"^^ 

B. Channel Correlation 

Subsequently, we define full and transmit correlation matrices of the channel. Furthermore, we 
characterize the structure of the correlation matrices of the dominant components of the channel. 
The results will be needed for the channel decomposition in Section IIII-CI and the performance 
assessment in Section |IVl 

1) Full Channel Correlation Matrices: We first define the length- A'tx^rx column vectors 
h[m] = vec {H[m]}, h[m] = vec {H[m]}, and h[m] = vec{H[m]}. The corresponding A^tx^rxX 
A^Tx^Rx full correlation matrices of the channel are then obtained as 

R[m] = E|h[m]h^[m]j; R[m] = E |h[m]h^[m]|; R[m] = E |ii[m]h^[m] | (5) 
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respectively. Using assumption 1) in Section ITlI-Ai it immediately follows that 

R[m] = R[m] + R[m] (6) 

holds. We can categorize the MIMO sub-links into co-polarized sub-links, i.e., links with VP to 
VP or HP to HP transmission, and into cross -polarized sub-links, i.e., links with VP to HP or HP 
to VP transmission. Depending on whether the four polarizations combinations share a dominant 
component or not, the rank of R[m] can vary. We show in Appendix lAl that generally we have 
rank {R[m]} < 4. Since the cross -polarized sub-links are hardly affected by, e.g., the occurrence 
of LOS, we consider the practically relevant setting that only the co-polarized sub-links can be 
affected by dominant components. Then, it can be similarly shown that rank |R[m]} < 2 has 
to be satisfied. Further specializing this setting to the case that the VP to VP and the HP to 
HP sub-links are affected by distinct dominant components with independent phase terms, it 
follows that rank |R[m]} = 2 is satisfied. When all polarization combinations share a common 
dominant component, we have rank {R[m]} = 1. For an SP setup, rank {R[m]} < 1 holds. 
2) Transmit Channel Correlation Matrices: The A'tx x A^tx TX correlation matrices are 

RtxH = E|H^[m]H*[m]|; RtxM = E |H^[m]H*[m] |; RtxH = E |H^[m]H*[m]|. 

(7) 

With assumption 1) in Section Ull-Al we have 

Rtx [m] = Rtx H + Rtx H- (8) 
We are interested in the structure, or more specifically the rank, of Rtx["^]- To that end, 
we assume that V^^m] = VRx,ab['m]vjx,ab[^] ^nd A^^ab[m] = djix,ab[m]d'^x,ab[^] with the 
deterministic length- A^rx column vectors VRx,afe["^] and dRx,afe["^], and the deterministic length- 
Atx column vectors VTx,ab["^] and dTx.abi?^] holdj^ In Appendix |Bl we show that generally 
rank { Rtx ["^]} < 4 holds. In the case that only the co-polarized sub-links are affected by 
dominant components, we obtain rank { Rtx i"^]} = 2. Finally, for an SP setup, we have 
rank { Rtx ['Ti]} = 1. 

^Note that this decomposition only imposes a rank-one condition for each polarization combination, which is reahstic when 
the distances between the TX, the RX, and possible dominant scatterers are large. 
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C. Channel Decomposition 

We now describe a simple method to separate the contributions of the dominant channel 
components and the remaining weaker scatterers from the channel correlation matrix. We thus 
aim at splitting R[m] into R[m] and R[m]. We note that in the mobile setting we cannot use 
the mean of the channel to decompose the channel into the dominant and the remaining channel 
components. We thus introduce a method to decompose the channel that is simple compared to 
high resolution parameter estimation techniques [19|. The method is inspired by the well-known 
i^-factor estimation in [EQI. It is suitable for both SP and DP MIMO channels. 

We use the second- and fourth-order moments of the channel R[m] = E |h[m]h^[m]} and 
T[m] = E {(h[m]h^[m])^}, respectively, to obtain a simple solution to the channel decompo- 
sition of R[m] = R[m] + R[m] into R[m] and R[m]. From Appendix O we have the relation 
T[m] = R[m] tr {R[m]} + R^[m] — R^[m] which can be reformulated as 

B?[m\ = R[m] tr {R[m]} + R^[m] - T[m]. (9) 

With the eigendecomposition R[m] = U[m] A[m]U^[m], we can thus directly obtain the unitary 
eigenvector matrix U[m] and the diagonal eigenvalue matrix A[m] of R[m]. 

1 ) Dual-Polarized Channel: According to Section IIII-Bll at most four eigenvalues of R[m] 
are non-zero; however, only two can be highly significant and smaller eigenvalues tend to be 
estimated less accurately. We thus have to exercise care in choosing the number of considered 
eigenvalues A^dp- Subsequently, we first find an estimate of R[m] denoted as R[m] according 
to (|9l). We then extract the Adp largest eigenvalues of R[m]; this step is akin to taking the best 
rank-Acp approximation of R[m] in terms of the matrix 2-norm [21, Th. 2.5.3]. Clearly, we 
have Ad? < 4. The final estimate of R[m] is 

A^DP 

R(^)[m] = ^Ck[m]uk[m]u^[m] (10) 
fc=i 

where the vector iik[m] denotes the eigenvector corresponding to the A;th largest eigenvalue 
Xk[fTi] of R[m] for k = 1,...,Adp- We now define the (positive semidefinite) estimates of 
R[m] and R[m] as R'^^)[m] and R'^^)[m], respectively. Moreover, we define R;[w^] = R'''^''[m] — 
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Ylk=i CA;["^]ufc[m]u|^[m] for / = 0, . . . , A^dp- The parameters Ck[m] for A; = 1, ... , A^dp are chosen 
such that R^"^) [m] = H^^^ [m] — R*^^^ [m] is positive semidefinite, see Appendix |Dl 



Cfc[m] = < 



0, for singular Rfc_i[m] 

min < A^[m], f iif [m]R;.\[m]ufc[m] j > , else. 



Note that some power of the dominant components corresponding to Uk [m] is transferred from 
R*^'^^[m] to R('^)[m] whenever Ck[m] < \k[m\. This might occur when the estimates of the 
moments R[m] and T[m] are inaccurate. 

2) Single-Polarized Channel: From Section ITlI-Al we know that R[m] can at most have rank 
one. We thus obtain the following estimate of R[m]: 

r(^) [m] = ci [m] ui [m] uf [m] . (1 2) 

The constant ci [m] is chosen as in (fTTI) to ensure the positive semidefiniteness of R*^^^ [m]. We can 
generate SP channel realizations H*-^^ [m] based on the statistical channel parameters according to 

vec {H(f)[m]} = ^/ci[m] Ui[m]e^'^ + (^R^^^^ [m] ) ' g (13) 

where </) is uniformly distributed over [— vr, tt), and g is a zero-mean proper Gaussian random col- 
umn vector of length A'tx^rx with covariance matrix 1^^^^^^^; and g are mutually independent. 

IV. Performance Assessment 

With respect to the system model in Section HIl the MI between the input x[m] and the output 
y[m] combined with instantaneous CSI at the receiver is given in bit/channel use (bit/c.u.) by 

/ (x[m]; y[m ],H[m]) = Ejlogdet {1n^ + pH[m]Q[m]H"[m])} 

= E {logdet (livxx + pH^[m]H[m]Q[m]) } (14) 



where, in (a), we used [22, Th. 1.3.20]. Note that the MI in (1141) is time-dependent as the 
channel is in general non- stationary; therefore, in a strict sense, (fT4l) is not an achievable rate. 
Nevertheless, we use the MI (fT4)) as performance measure since it has an interpretation in terms 
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of an achievable rate in bit/channel use (bit/c.u.) for non- stationary slow- and fast-fading wireless 
channels [|23l, [|24ll. 

With Appendix |El we can state the following second-order approximation of (fT4l) : 

/ (x[m]; y[m], H[m]) 
^/(^) (p,Q[m],RTxN,Z[m]) 
= logdet + pR*^[m]Q[m]) - i^^^ 

X tr |z[m] ((Q[m] (I^,, + pIV,^[m]Q[m])-'f ® (Q[m] (I^,, + pR^^NQH)'')) } (15) 

with the iV|x ^ -^tx fourth-order moment matrix of the channel 



Z[m] = E <j vec {H^[m]H[m] - R^xM} (vec {H^[m]H[m] - R*^[m]})" \ . (16) 



Additionally to RtxI"^]^ (|T5I) requires the evaluation of the fourth-order moment of the channel 
Z[m]. In order to gain insight on the influence of typical statistical channel parameters on the MI, 
we rewrite Z[m] as a function of R[m] and R[m] only. Both of these parameters are available 
with the channel decomposition in Section IIII-C[ In order to restate ([T6l) for SP as well as 
for DP channels, we assume that only the co-polarized sub-links can be affected by dominant 
components. In Appendix |Fl we then obtain the following result: 

vec {Z[m]} = (Iat^.^ (g) Y[m]) vec {R[m]} 

+ (Kn^^^Ntx ® ^Ntx,Ntx) (W ® Y* [m]) vec {R*[m] } (17) 

with the A^lx ^ -^tx^rx block matrix Y [m] containing I tvtx ® '^k,i [itt] in the kth row-partition and 
the Ith column-partition for = 1, . . . , A^tx and 1 = 1,..., Arx- The Atx x N^x matrix 'Xk,i[m] is 
defined by Xfc^;[m] = R[m] for p = 1, . . . , Atx and q = 1, . . . , Arx- 

L ' Jp,q i (k-l)NKx+l,(j>-l)NRx+q 

Note that R[m] = R[m] + R[m] holds. 



A. SP vs. DP Performance: High-K-Factor Case 

We now compare the performance of SP and DP setups in the high-i^-factor regime. First, 
consider the case of an asymptotic iC-factor setting, i.e., infinitely large i^'-factors, and that only 
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the co-polarized sub-links have dominant components. Then, the Jensen bound on the MI given by 

/(^) (p, Q [m] , Rtx [m] ) = log det (I^,^ + pR^x [m] Q [m] ) (18) 

and corresponding to the first term in ([15} is equal to the MI ([141) : it can thus be used for a simple 
analytical performance evaluation. Note that the channel influences the Jensen bound on the MI, 
i.e., ([TSl) . only through Rtx["^]- In the asymptotic i^-factor setting, we have Rtx["^] = Rtx["^]- 
Using Hadamard's inequality [[22l Sec. 7.8.1], it can be shown that ([T8l) is maximized by 
choosing the eigenvectors of the input co variance matrix Q[m] to be given by the eigenvec- 
tors of Rjxf'Ti]. I.e., for the eigendecomposition Rjx["^] = Utx["^] Axxf^^jU-^fm] with the 
unitary eigenvector matrix Utx["^] and the diagonal eigenvalue matrix Axxi'ri] of R^xi"^]^ 
we obtain Q[m] = Utx["^] AQ[m]U^[m]. Here, AQ[m] is the diagonal eigenvalue matrix of 
Q[m] determining the power allocation. Furthermore, we define ATx,fc["^] = [ATx["^]]fc,A: and 
AQ,fcH = [KQ[m]]k,k for /c = 1, . . . , A^tx, where Atx.^H > ATx,fc+iH for A; = 1, ... , A^tx - 1 
holds. 

The crossing points between the MI of an SP setup and the MI of a DP setup are then given by 

/^'^ (p, QTx,sp[m], Rtx.spN) = (p, Qtx,dpM, Rtx,dpH) (19) 
J]^ (1 + pATx,sp,fcHAQ,sp,fcH) I = log I J]^ (1 + pATx,DP,fcHAQ,DP,fcH) I (20) 

where Atx,sp,a,["^] and ATx,DP,fc["^] for k = l,...,Njx are the eigenvalues of the SP and DP 
transmit correlation matrices Rtx,sp["^] and Rtx,dp["^], respectively. Similarly, AQ,sp,fc["^] and 
AQ,DP,fc["^] for k = l,...,Njx are the eigenvalues of the SP and the DP input covariance 
matrices Qtx,sp["^] and QTx,Dp['Ti], respectively. As highlighted in Section IIII-B2[ we have 
rank {Rtx["^]} = 2 if only the co-polarized sub-links have dominant components and we have 
rank I Rtx ["^]} = 1 in the SP case with a dominant component. In the high-K-factor regime 
with dominant components for co-polarized propagation only, we thus have to decide between 
an SP setup with one transmitted stream and a DP setup with two transmitted streams. 

To obtain the crossing points when Atx.sp.iI"^] > 0, Atx.dp.A:!"^] > 0, and Aq^dp,*,!"^] > for 
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/c = 1, 2, we simplify to 

log (1 + pAtx,SP,iM) = log (^Yl + P-^TX,DP,fcHAQ,DP,fc[m])^ (21) 

2 

1 + pAtx,sp,iH = n (1 + pATx,DP,fcHAQ,DP,fcH) • (22) 



k=l 



Besides the crossing point at p = 0, there is a crossing point at 

(J)|-_l _ Atx,SP,i['^] ~ Asum,Dp["^. 



p'c^[m] = ' ' (23) 

Aprod,DP [^^J 

which is positive if Atx,sp,i["^] > Asum,Dp["^]- Here, we defined 

Asum.DpN = Atx,dp,iN-^Q,dp,iM + Atx,DP,2 M Aq,dp,2 H (24) 

Aprod,DpH = ATX,DP,lMAQ,DP,lMATX,DP,2[?Ti]AQ,DP,2H. (25) 



By inspecting (|22|) . we observe that the contribution of the MI of the SP setup, i.e., the left hand 
side of (I22I) . is a linear function of the SNR p, while the contribution of the MI of the DP setup, 
i.e., the right hand side of (|22l) . grows quadratically with the SNR p. We thus conclude that the 
DP setup outperforms the SP setup only at SNR values above Pcp["^] if Atx,sp,iM > Asum,Dp['Ti] 
holds. Otherwise, the DP setup always outperforms the SP setup. 

B. SP vs. DP Performance: General Case 

In this section, we study the performance of the SP and the DP setup in the general case of 
arbitrary i^'-factors. Now, we need to consider the approximate evaluation of the MI (fT5l) and 
cannot restrict to the Jensen bound on the MI. Similarly to Section IIV-A[ we consider the case 
of the SP setup transmitting a single stream and the DP setup transmitting two streams with 
positive Atx,sp,i["^], Mx,jiv,k[m\, and AQ,DP,fc["^] for A; = 1,2. Furthermore, we again choose the 
eigenvectors of R^^i"^] eigenvectors of the input covariance matrix Q[m]. In order to get 

a closed-form expression of the crossing points, we derive a lower bound on the approximate 
MI (fISl) in Appendix O It is given by 

/(L^) (p, Q[m], RtxH, Z[m]) = logdet (I^Vtx + pR^xMQH) - ^og{e)w[m] (26) 
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with 

Mm] = 2^ 2^ r-TT r-] (27) 

and the number of transmitted streams N^i. We note that this lower bound is tight in the limit 
p — 7- 00. Based on (|26|) . we calculate the crossing points of the MI of the SP setup and the MI 
of the DP setup by considering 

I^"^""^ (p, Qtx,sp H , Rtx,sp H , Zsp [m] ) = /(^b) (p, Q^x,dp H , Rtx,dp H , Zdp [m] ) (28) 

where Zsp[m] and ZDp[m] denote the matrix Z[m] for the SP and the DP case, respectively. 
Similar to Section |IV-A[ we note the linear and the quadratic growth with the SNR p of 
the exponentiation (with respect to the base 2) of the MI (|26l ) for the SP and the DP setup, 
respectively. We then obtain a crossing point above which the DP setup outperforms the SP 
setup at 



^ 2Aprod,Dp["^] V V 2Aprod,Dp["^] / Aprod,Dp["^] 

(29) 

if 4(1 — a[m])Aprod,Dp["^] < (Atx,sp,iMq^["^] ~ Asum,Dp["^])^ is satisfied. Otherwise, the DP setup 
always outperforms the SP setup. Here, we defined a[m] = exp(wDp["^] — ^'^'sp[^]), which is a 
correction factor, and Wsp[m] and u'Dp[m] are obtained from (|T7T ) for the SP and the DP case, 
respectively. When a;[m] = 1, we recover the solution (|23] ). 

V. Channel Measurements 

We evaluate the previously obtained results using urban macrocell channel measurements 
that were performed at 2.53 GHz in two bands of 45 MHz in Ilmenau, Germany. During the 
measurement campaign, the DP MIMO channel from three base station (BS) positions with 
different heights to a multitude of MT tracks was measured sequentially. The MT was moving 
with a maximal velocity of about 10 km/h. In this paper, we extract the 20 MHz band centered 
at 2.505 GHz, and we use the three BS positions at a height of 25 m with the three MT reference 
tracks. For further details regarding the measurement campaign, see [|25l . [|26l . 
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After denoising the channel measurements in the time-delay domain, we normalize the channel 
matrices H[m]. The normalization is performed with a scalar factor such that E {| |hco[m] 1 = 
A^co is emulated inside each stationarity region containing A^^ = 16 samples in time and Nj = 
128 samples in frequency. Here, hco[m] is a vector containing only the N^o elements of H[m] 
corresponding to co-polarized sub-links. This guarantees a fair comparison between SP and DP 
setups since we account for the power loss in cross -polarized sub-links. Then, we estimate the 
statistical quantities by replacing the ensemble averaging with an averaging over Nt time and 
Nf frequency samples. This yields a total of 2048 (~ 500 non-coherent) realizations [|26l . 

A. Antenna Setups 

We choose a uniform linear array at the BS and two uniform circular arrays (UCAs), which 
lie on top of each other, at the MT for the subsequent evaluations. The antenna arrays consist of 
patch antennas that can be excited vertically and horizontally. Due to the UCAs at the MT, we 
are able to differentiate between the following four orientations: the front (direction of motion), 
the back, and the two sides of the MT. For our evaluations, the BS and the MT act as the TX and 
the RX, respectively. We consider two SP antenna setups, a VP and an HP setup, as well as two 
DP antenna setups, a co-located (DP-CL) and a spatially separated (DP-SS) setup, for the 4x4 
MIMO case. For the SP setups, the antennas are separated by Ac at the TX and 0.5Ac (different 
UCAs) or 0.327Ac (same UCA) at the RX. For the co-located DP-CL setup, the antenna patches 
at the TX and the RX are separated by 3Ac and 0.5Ac (across the UCAs), respectively. For the 
spatially separated DP-SS setup, we use the same antenna patches as in the SP case. However, 
we have a separation of 2Ac between antennas of the same polarization at the TX side. At the 
RX side, the lower UCA is only used for the VP excitation while the upper UCA is only used 
for the HP excitation. We note that all setups result in the same array length at the TX. 

B. Scenario Classification 

Based on the measurements, for the SP case, we mainly observe links with either low K- 
factors and low correlations between the MIMO sub-links or links with high i^'-factors and high 



16 



TABLE I 

Specification and Properties of the Reference Links 



Link 


BS 


Track 


MT orientation 


MT position [m] 


J-f -Factors 


1 


1 


41a-42 


back 


- 34.9 


low 


2 


3 


9a-9b 


left 


- 38.9 


medium 


3 


2 


10b-9a 


front 


9.8-56.8 


high 


4 


3 


10b-9a 


left 


- 64.9 


varying 



correlations. A similar observation was made in (Tl] and [|28ll . Thus, similar to fl^, we classify 
the measurements into links with low, medium, and high (co-polarized) i^'-factors, see Table H 
The low i^'-factor links are characterized by i^'-factor values in [0,2], while the medium and 
high K-factors links have several peaks with values above 5 and 10, respectively. Additionally, 
we have one link with varying fT-factors which consists of low and high iiT-factor parts. The 
reason for the low iT-factors/correlations in link 1 and 2 is that track 41a-42 is partly located in 
a street canyon; regarding BS 1 and 3 no dominant components are expected. In contrast, tracks 
9a- 9b and 10b-9a are mostly situated in an open environment where dominant components are 
more likely to occur. 

VI. Results 

In order to check the efficiency of the channel decomposition, we compare the f^-factors from 
the decomposition to the ones obtained from the measurements with the moment method in [20]. 
The results on the i^-factors are averaged over the sub-links of each polarization combination for 
the DP-CL setup. Subsequently, we consider the practically relevant case of extracting A^dp = 2 
eigenvalues, see Section |III-C1[ In Table HH we show the results for links 1-3 averaged over 
the driven distance. We see that the cross-polarized sub-links, VP to HP (V-H) and HP to VP 
(H-V), show significantly smaller i^-factors than the co-polarized ones, VP to VP (V-V) and HP 
to HP (H-H). In general, we observe lower i^'-factor values from the channel decomposition; 
this is due to guaranteeing the positive semidefiniteness of the correlation matrices, which can 
result in a shift of the power from the dominant components to the remaining components of the 
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TABLE II 

Average if -Factors From the Measured Channel and the Proposed Channel Decomposition 





A'-factors: Measurements 


Jf-factors: Decomposition 


Link 


V-V 


H-H 


V-H 


H-V 


V-V 


H-H 


V-H H-V 


1 


0.5 


0.8 


0.4 


0.4 


0.5 


0.6 


0.3 0.3 


2 


1.6 


1.4 


0.6 


0.7 


1.2 


0.9 


0.2 0.2 


3 


4.0 


5.7 


1.9 


1.8 


4.0 


5.4 


1.7 1.5 



channel, see Section IIII-C[ In Fig. [H we depict the evolution over distance for link 4 since it is 
characterized by varying i^-factors, see Table HI Similar obervations as in Table HI] can be made. 
Furthermore, we observe that the channel decomposition is able to reproduce the tendencies in 
the evolution of the measured i^-factors. 

Next, we evaluate the performance of the SP and the DP setups. In order to compare the 
approximate evaluation of the MI, i.e., (fT5l) with (flTl) . to the (exact) MI (fT4l) . we use A^^dp = 2. 
We use the optimal input with respect to the Jensen bound on the MI, where the eigenvectors 
of Ryxl'Ti] form the precoding and the power allocation is obtained by a simple water- filling 
strategy [|29l , unless otherwise specified. The results of links 1-3 are accumulated over each track 
and shown as a function of the SNR in Fig. [2l We observe that only at high SNRs there is a 
noticeable gap between the MI and its approximate evaluation. The DP-CL setup only provides 
an advantage in terms of the MI compared to the SP setups, i.e., the VP and the HP setup, if 
the i^-factors (of the co-polarized sub-links) and the SNR attain certain values; the higher the 
/T-factors, the lower this SNR threshold is. Practically, a switching between SP and DP setups 
is thus most useful in medium- to high-i^'-factor scenarios; there the crossing points between 
the MI of the SP setups and the DP-CL setup are accurately reproduced by the approximate 
evaluation of the MI, i.e., (fT5l) with (flTI) . Furthermore, in Fig. [3l we plot the MI over distance for 
the VP, the HP, and the DP-CL setup on link 4 at an SNR of 10 dB. We observe that the positions 
at which the DP-CL setup outperforms the SP setups coincide with high K-factors, see Fig. [U 

We now compare the performance using two different DP setups, the DP-CL setup with co- 
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Fig. 1. ii'-factors vs. distance on link 4 (averaged over sub-links of the DP-CL setup with the same polarization combination). 



located antennas and the DP-SS setup with spatially separated antennas. In Fig.lH we show the 
MI of the DP-CL and the DP-SS setup, exemplarily, on link 2. We observe that the DP-SS setup 
is able to reach even higher MI values at high SNR. We expect that this is due to the increased 
viewing angle into the propagation channel for each polarization at the RX side, which results 
in an increase in the degrees of freedom. The DP-CL setup, however, offers a more compact 
antenna array at the cost of a reduced viewing angle at the RX. Furthermore, we observe here 
that the approximate evaluation of the MI is more accurate for the DP-SS setup than it is for 
the DP-CL setup. 

The average SNR values above which the MI of the DP-CL setup with two streams and equal 
power allocation is higher than the MI of the VP or the HP setup with a single stream are given 
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(a) Low-A'-factor link 1 
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Fig. 2. MI vs. SNR of the exact and the approximate evaluation for the VP, the HP, and the DP-CL setup. 



Table Hill Note that the preceding is again given by the eigenvectors of R^xi'Ti]. The resulting 
crossing points are calculated using the various methods introduced before, i.e., using the MI 
and the approximations given in (fT4l) . (fTSl) . (fTSl) . and (1261 ) together with (fTTl) . We observe that the 
approximate evaluation of the MI (fTSl ) is able to accurately reproduce the average SNR values. 
When using the Jensen bound on the MI, we obtain lower average SNR values. Note that the 
Jensen bound on the MI is only useful for high-K-factor links; thus, we only give the results 
for link 3. The SNR values obtained from the lower bound on the approximate MI, i.e., (|26l) . 
yield a slight overestimation of the average SNR values for all links. We observe that all the 
(exact) crossing points are roughly between 5 and 7 dB. A clear dependence on the link is not 
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Fig. 3. MI vs. distance for tlie exact and tlie approximate evaluation for tfie VP, the HP, and tfie DP-CL setup on link 4 with 
an SNR p = 10 dB (the blue-shaded regions denote positions where the (co-polarized) isT-factors are high, cf. Fig. [T). 
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Fig. 4. MI vs. SNR for the DP-CL and the DP-SS setup on link 2. 



present; this is due to the restriction to two and one transmitted stream for DP and SP MIMO 
systems, respectively. 

VII. Conclusion 

In this paper, we have studied the modeling of DP MIMO channels as well as the performance 
over such channels. We proposed a general model for DP mobile Ricean channels with a 
channel decomposition technique yielding necessary statistical channel parameters. Furthermore, 
we derived an approximation of the MI, which is a function of those parameters, in order to 
gain some understanding on the statistical channel parameters influencing the MI. Based on 
the approximate evaluation of the MI, we were able to analytically characterize the required 
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TABLE III 

Average SNR Values Above Which the MI of the DP-CL Setup with Two Streams and Equal Power 
Allocation is Higher than the MI of an SP Setup with a Single Stream 





SNR Values pcp [dB] (averaged) 




VP vs. DP-CL 


HP vs. DP-CL 


Method 


Link 1 


Link 2 


Link 3 


Link 1 Link 2 


Link 3 


Exact: l[l4j 


4.998 


6.759 


5.242 


5.559 7.197 


5.744 


Appr.: ([15) 


5.073 


7.027 


5.240 


5.567 7.324 


5.680 








4.623 




4.813 




6.154 


7.722 


5.747 


6.564 7.976 


6.130 



SNR for a dual-stream DP MIMO system to outperform a single-stream SP MIMO system. 
Finally, we applied the obtained results to channel measurements performed in an urban macrocell 
environment at 2.53 GHz. We find that for sufficiently high _ft"-factors DP MIMO systems are 
able to outperform SP MIMO systems if a certain, practically relevant, SNR is attained. 



Appendix A 
Rank of R[m] 

We are interested in a condition on the rank of R[m] for the DP case. We first drop the time 
argument for notational simplicity. Then, we rearrange R through column and row permutations 
with the permutation matrix P into R'^p) = PRP^ such that 

Rvvvv RyvvH RyVHY RvVHH 



R(P) 



RvHVV R-VHVH R-VHHV R-VHHH 



Rhvvv Rhvvh R-hvhv Rhvhh 



R-HHYV R-HHVH R-HHHV R-HHHH 



(30) 
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with Habcd = E jvec {Haf,} (vec {Hcd})^| for a, b,c,d e {V, H} holds. We now have 
r(p) (J (yec {V} (vec {V})^) E jvec {$} (vec {^}f} 

(vec {V} (vec {V})^) (vec {A^ (vec {A^)^) (g 
where, in (a), we used (|3]) and defined 



vec {V} = (vec {Vw})^ (vec {Vvh})^ (vec {Vhv})^ (vec {Yhh})"^ 



vec {$} = (vec {$vv})^ (vec {$vh})^ (vec {$hv})^ (vec {$hh})^ 
and, in (b), we used (HI) and defined 



1 T 



vec {A,^} = (vec {A^^vv})^ (vec { A^^vh})^ (vec { A^^hv})' (vec { A<^,hh})' 

1 5'VVVH fi'VVHV fi'VVHH 

5'VHVV 1 S'VHHV fl'VHHH 

fi'HVVV S'HVVH 1 fi'HVHH 

.fl'HHVV 5'HHVH 5'HHHV 1 

with gabcd = E {e-'*^'^''''"'^'^'*)} for a,b,c,d G {V, H} and the all-one matrix Ia? of size N x N. 
As the rank of a matrix is unchanged by left or right multiplication with a non-singular matrix 
[|22l Sec. 0.4.6 (b)], it is obvious that 



1 T 



(31) 

(32) 
(33) 

(34) 



G 



(35) 



rank {R} = rank {R^p^} (36) 
rank{G}<4 (37) 

hold. Moreover, we have rank{A0B} < rank {A} rank {B} [30, Th. 5.1.7] as well as 
rank{A0B} = rank {A} rank {B} |[30i Th. 4.2.15] for matrices A and B of appropriate 
sizes. With (|3T1) . (|36l) . and (1371) . we then immediately obtain the inequality 

rank{R}<4. (38) 

Appendix B 
Rank of Kjxlm] 

We first drop the time argument to simplify notation. In order to evaluate the rank of Rtx, we 
use Vab = VRx,ab^Tx.ab ^nd A^^ab = dRx,afed^x ab- ^ascd on we decompose the dominant 
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channel component for each polarization combination a,b E {V,H} as 

Haft = (vRx.af, dRx,a6) {^TXM © d^x^abf e^'^""- (39) 

We then obtain for a, h,c,d E {V, H} 

E {HjftH^J = (vTX,a6 dTX,a6) (vTX,cd © dxx.cd)^ /abed (40) 

with fabcd = (vRx,afe © dRx.afe)^ {^Rx,cd © dRx,cd)* E { e^(<^-''-'^^'*) } . With ©, we can write 



Rxx — E < 



TtT XT* _|_ TTT |LT* T-JT U* I xtT tlt 



TtT XT* _|_ XjT |LT* T-jT TJ* I xtT |LT* 

-"-HV-n-vv "T -n-HH-n-VH' -n-HV-n-HV "T -n-HH-n-HH 



(41) 



Using (gTj) with (1401), we obtain 



R 



TX 





tvv/vvvv 




tvv/vVHV 


^HV 


+ 




tvH/vHVH 




tvn/vHHH 






tHv/nvvv 




tHv/HVHV 








tHH/HHVH 









(42) 



with tab = VTx,afe © dxx.afc for o, 6 G {V, H}. For matrices A and B of appropriate sizes, we 
have rank {A + B} < rank {A} + rank{B} JSOl Sec. 0.4.5 (d)]. Thus, we conclude that 

rank{RTx}<4 (43) 

must hold. If only the co-polarized sub-links have dominant components, rankjRTx} = 2 is 
obtained using (|42l) . For an SP setup with a dominant component, we have rank {Rtx} = 1- 

Appendix C 

Evaluation of the Fourth-Order Moment T[m] 

We now evaluate the fourth-order moment T[m] = E | (h[m]h^[m])^|, where we drop the 
time argument for notational simplicity: 

T = E I (hh^ + hh^ + hh^ + hh^)^ 

= E jhh^hh^l + E jhh^hh^l + RR + RR + E jhtr |R}h^| + E jhte 
= Rtr |r| + R2 + Rtr |r| + RR + RR + Rtr |r| + Rtr |r 



Rtr{R} + R2 -R2 



(44) 
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In (a), we used E{h} = Oat^xA^rx,!' E{h} = O^^^NuxA^ mutual independency of h and h, 
and that E{hh^} = ONTxNRx,NjxNjix holds due to propemess of h. In (b), we made use of the 
fact that h^h = tr{R}, and we used [3T, Th. 1] which yields the following identity for the 
zero-mean proper Gaussian random vector h: 

E jhh^hh^l = E jiiii^l E jiiii^l + E jiiE jh^hj . (45) 

In (c), we used R = R + R. 

Appendix D 

Sufficient Condition for a Positive Semidefinite R('')[m] 

In order to derive a sufficient condition for the positive semidefiniteness of RA:[m],Vfc = 
1, . . . , A^DP and thus R*^^)[m], we need to solve the following inequality for Cfc[m], V/c = 1, . . . , A^dp 
for which we drop the time argument: 

(Rfc_i - Cfc Ufcuf ) z > 0, Vz e C^TxiVRxxi_ (46) 

The case z = Oat^xA^rx,! trivially satisfied. In case z 7^ O^^^Njui,!^ we first consider non-singular 
Rfc-i. We define z = R^^iZ and rearrange (|46] ) to obtain 

Ck z^Ufcuf z ^ Ck z^Rfc_\Ufcuf Rfc_\z ^ ^ 
z^R,_iz z^z 

The matrix R^^j^UfcUf R^^^ is positive semidefinite with rank one such that, with the Rayleigh- 
Ritz theorem [|22l Th. 4.2.2], we have 

z^^Ri. ^ UfcU^^R;. ^ z / - -i w - -i \ 

< ''^H^' < An^ax (R,_\u.uf R,_\) • (48) 



Finally, with (gTj) and (gSj), we obtain 

Ck < (Rfciiu.uf R,"A) = (ufR,-^Ufc) "\ V/c = 1, . . . , iVop (49) 

which is a necessary and sufficient condition for R^, V/c = 1, . . . , A^dp to be positive semidefinite 
if Ra;_i,VA; = 1, . . . , A^DP is non-singular. In the case of a singular Rfe-i, we set = 0. We 
thus obtain a sufficient condition for R*^^^ to be positive semidefinite. We note that (|49l ) (for 
non-singular Rfc_i, V/c = 1, . . . , A^dp) can also be derived based on [22, Th. 7.7.7]. 
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Appendix E 
Approximate Evaluation of the MI 

The approximate evaluation of the MI relies on a multivariate Taylor series expansion. We 
consider a complex function /(a, a*) with complex column vector arguments a and a* of lengths 
A^^. We note that a* is the complex conjugate of a. The second-order approximation of a and 
a* at ao and aj, respectively, is given by [l32l 



/(a, a*) ^ /(ao,ao) + 



dl 
da 



(a - ao) + 



df 



a=ao,a"'=ar. 



da* 



la an 



a=ao,a*=a| 







^ a - ao)^ ■ Hf (ao, a*) ■ (a - ao) + ^(a - ao)^ ■ Hf (ao, a*) ■ (a* - a*) 



+ ^(a - ao)^ ■ H-(ao, a^) ■ (a* - a*) + i(a - ao)^ ■ Hf (ao, a*) ■ (a - ao) (50) 



with the row vector df/da defined by [df/da]i^k = df/d[a\k,i for k = 1,...,N and the 
X A^^ Hessian matrices 



a (df_ 

da* 



Hf(ao,a*)= (H^^(ao,a*))' 



HT(ao,a*) 



d fdf 



da* V da 



T 



a=ao,a''=a^ 



a=ao,a*=a5 



Hj*(ao, ajij) 



9_ /9/ 

da \ da 



(51) 



a=ao,a*=a2 



We now consider the function /(a, a*) = /(a) = Indet A with a = vec {A} and the N x N 
matrix A. By using /(a) in (|50l ) with ao = E {a} and applying the expectation operator, we 
obtain the second-order approximation 

E {/(a)} ^ /(E {a}) + 1 tr {E { (a - E {a})(a - E {a})^} Hf (E {a}) } (52) 



where we used that only the first two and the last term in (pOl ) are non-zero. The Hessian matrix 
Hf (E {a} , E {a*}) = Hf (E {a}) is given by (Sl 



(E {a}) = -K^,^ ((E {A})-^ ® (E {A})-^) . 



(53) 
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For A = B + CDE with deterministic N x N matrices B, C, and E, (|52l ) can be written as 

E {/(a)} /(E {a}) + i tr { (E^ ® C) E {(vec {D - E {D}})(vec {D - E {D}})^} 
(E®C^) Hf(E{a})} 
/(E {a}) - ^ tr I (E^ C) E { (vec {D - E {D}})(vec {D - E {D}})'^} K 



X 

(b) 



N,N 



X (C^ E) ( (E { A})-^ ® (E { A})-^ 



/(E {a}) - i tr I E { (vec {D - E {D}})(vec {D^ - E {D^} })^} 
X (E (E {A})-' C)^ ® (E (E {A})-' C) }. (54) 

In (a), we applied vec {CDE} = (E^ (g) C) vec {D} [30, Lemma 4.3.1]. In (b), we inserted 
(153]) and used (E ® C^) Kn,n = ^n,n (C^ ® E) f34l, Th. 3.1 (viii)]. Finally, in (c), we used 
K^^^ = K^,^ llM Th. 3.1 (ii)] and (A ® B) (C ® D) = (AC) O (BD) ^ Lemma 4.2.10]. 

Appendix F 

Approximate Evaluation of the MI by Means of the Proposed Channel Model 



We restate (|T6l) as a function of the parameters obtained in the channel decomposition in 
Section UlI-Cl i.e., R[m] and R[m], only. To simplify notation, we drop the time argument for 
the remainder of this appendix. First, we rewrite dHJ: 



Z = E i vec |h^H + H^H + H^H + H^h) ( vec |h^H + H^H + H^H + H^H 



(b) 



H 



- vec {r^x + R^x} (vec {r^x + R^x}) 
E jvec {H^H} (vec {H^H})^| - vec {R^x} (vec {R^x})^ 

+ E|vec|H^H| (vec|H^H|)^| -vecjR^x} (vecjR^x})^ 

+ E|vec|H^H| (vec |h^h})^| +E jvec |h^h| (vec|H^H|)^| (55) 

with Rtx = E{H^H*} and Rtx = E{H'^H*}. In (a), we applied H = H + H and Rtx = 
Rtx + Rtx- In (b), we used the propemess of H to establish 

E (vec {h^h| (vec {h^h})''| = On^^,n^,- (56) 
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E jvec |h^h| (vec |h^h| 



We now have 

"} E { ® H^) vec {H} (vec {H})^ (i 
= E I (l^,^ ® H^) R (l^vxx ® h) } 
where, in (a), we used [|30l Lemma 4.3.1]. Similarly, we have 
E I vec |h^h| (vec |h^h|)^ 



H 



(57) 



(J 

(b) 



E <! ( ® I^Vtx ) Ktvtx 



vec {H} (vec {H})" K^^^^^tx ( H ® W 



KaTtx.A^tx ( E <{ ( Ia^tx 



) R ( I 



La^tx 



H 



K 



(58) 



where, in (a), we used [1341 Th. 3.1 (ii)], and, in (b), we used [1341 Th. 3.1 (viii)]. Next, we have 
E <! vec <! H^H !> f vec <! H^H ' ^ ^ 



(a) 



E I (In^^ ® H^) vec {h}} E <! ( vec <i H 



H 



H 



+ E I 



H^)R(I^,,®H 

H 



vec {r;^} (vec {r;^}) + E { 



) R ( I 



iNjx 



H 



(59) 



where, in (a), we used [I3T1 Th. 1] with the propemess of H. In order to evaluate (1571 ). (1581 ). and 
(|59l) . we use that 

vec |e I (^liVTx ® H^) A (^livxx ® h) } } = (w ® E ® I^,^ ® H^}) vec {A} 

= (I;vxx®Y)vec{A} (60) 

holds for a deterministic A^tx^rx x ^tx^rx matrix A. Here, the x A^xx^rx block matrix Y 
contains Itv^x ; in the kth row-partition and the /th column-partition for k = 1, . . . , Njx and 
/ = !,..., A^Rx- The A^tx x A'rx matrix X^j is defined by 







R 









(fc-l)AfRX+/,(p-l)AfRx+g 

for p = 1, . . . , A^Tx and q = 1, . . . , A^rx- For the DP case where only the co-polarized sub-links 
can be affected by dominant components, we can write H = Hi + H2 with 



Hi 



H 







vv 



JVgx Njx 
2 ' 2 



Oi 



Ojvrx JVtx 
2 ' 2 



Ho 



Oi 



liVRx Njx 

2 ' 2 



H 



HH 



(61) 
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Obviously, we have Hf H2 = Onj^^Njx and H^H = Hf Hi + H^Hs- Furthermore, with dD, 
we have R^x = E {Hf Hi + Hf H2} = H^H. It thus follows that 

E jvec {H^H} (vec {H^H})^| = vec {R^x} (vec {R^x})^ • (62) 

Clearly, the same result holds in the SP case. At last, using ([55]) with ([57]), ([58]), ^9^, and 
(|62l ). we obtain the result in (flTT ). 

Appendix G 

Lower Bound on the Approximate Evaluation of the MI 

In order to lower-bound the approximate MI ([T5] ), we find an upper bound for the trace in 
the second term of (fTSl ) for the case that the eigenvectors of R^x["^] form the precoding for 
the A'^^t transmitted streams. We drop the time argument in the following derivation. Using the 
eigendecompositions Rjx = ^tx-^tx^tx ^nd Q = UxxAgUff, we can write 

tr {Z ((Q (I;v.x + P^TxQrY ® (Q (W + pR^xQ)"')) } 

tr |z(U;x ® Utx) ((Aq (W + pAjxAq)-Y® (Aq (I^tx + pAtxAq)-^)) (U^x ® Uf 
tr I ((U^'x ® Ufx)Z(U;x ® Utx)) 

((Aq (Ijvtx + pAtxAq)-')^ ® (Aq (I^,^ + pAtxAq)-')) } 



'TXJ 



[(Ufx^UT^x)Z(Uk0UTx)](,_i)^^,^,(,.i)^^^^, Xq,,Xc 
^ (1 + pXjx,kXQ,k) (1 + pAtxjAq,;) 



1 ^ ^ [(U^x ^ Ufx)Z(UTx ^ Utx), (,_i)^,,,,(,_i)^,,,, 
-^1^1^ A A • ^^^^ 

P ,=1 ATX,fcATX,/ 

In (a), we used fSO", Lemma 4.2.10] as in (|54|) . In (b), we applied the identity tr{AD} = 
tr {A D} for matrices A and D of appropriate sizes, where D is diagonal. In (c), we made 
use of the fact that only the first N^^i elements on the diagonal of Aq are non-zero. Finally, for 
(d), we note that (U^x ® U|^)Z(Utx ® Ujx) is positive semidefinite. 
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